About

Overview

The cran-search project aims to provide a database to perform a package search for the R programming language in the Comprehensive R Archive Network (CRAN) repository. The collected data are obtained by the tools::CRAN_package_db() function and selected only a few columns to perform the search for the topic of interest.

In the following table, it is possible to verify a brief structure of the data frame collected with packages available in CRAN. For example, the number of rows and columns, and the frequency of words longer than 3 or 4 characters for the column named title, description, and license. A depth investigation of the data is at the discretion of the reader.

update structure information
2025-01-10 column update, package, version, license, title, description, date, depends, import, url
2025-01-10 n_column 10
2025-01-10 n_row 21882
2025-01-10 NA TRUE
2025-01-10 title frequency: (1) data 3609 (47.97%), (2) analysis 2245 (29.84%), (3) with 1670 (22.20%)
2025-01-10 description frequency: (1) data 14621 (42.21%), (2) with 10021 (28.93%), (3) package 9997 (28.86%)
2025-01-10 license frequency: (1) license 6082 (50.15%), (2) file 5566 (45.90%), (3) apache 479 (3.95%)

Author

Author
name url
author Bruno Faria
website https://brunofariadf.github.io/
github https://github.com/brunofariadf/
Project
name url
main cran-search
review News
license MIT